Picture for Shaoteng Liu

Shaoteng Liu

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Add code
May 19, 2026
Viaarxiv icon

Masked Generative Transformer Is What You Need for Image Editing

Add code
May 11, 2026
Viaarxiv icon

Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion

Add code
Feb 08, 2026
Viaarxiv icon

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Add code
Dec 19, 2025
Viaarxiv icon

EditMGT: Unleashing Potentials of Masked Generative Transformers in Image Editing

Add code
Dec 12, 2025
Viaarxiv icon

Training-Free Efficient Video Generation via Dynamic Token Carving

Add code
May 22, 2025
Figure 1 for Training-Free Efficient Video Generation via Dynamic Token Carving
Figure 2 for Training-Free Efficient Video Generation via Dynamic Token Carving
Figure 3 for Training-Free Efficient Video Generation via Dynamic Token Carving
Figure 4 for Training-Free Efficient Video Generation via Dynamic Token Carving
Viaarxiv icon

Generative Video Propagation

Add code
Dec 27, 2024
Figure 1 for Generative Video Propagation
Figure 2 for Generative Video Propagation
Figure 3 for Generative Video Propagation
Figure 4 for Generative Video Propagation
Viaarxiv icon

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Mar 27, 2024
Figure 1 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 2 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 3 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Figure 4 for Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Viaarxiv icon

RL-GPT: Integrating Reinforcement Learning and Code-as-policy

Add code
Feb 29, 2024
Figure 1 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 2 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 3 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Figure 4 for RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Viaarxiv icon

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Add code
Oct 19, 2023
Figure 1 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 2 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 3 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 4 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Viaarxiv icon